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This paper describes the dsp, a recently developed integrated 
circuit implementing a programmable digital signal processor. The 
single-chip device is fabricated in depletion- load nmos and is pack- 
aged in a 40-pin dip. It has the speed, precision, and flexibility for a 
variety of telecommunication applications. The processor can decode 
an instruction, fetch data, perform a 16- by 20- bit multiplication and 
a full 36-bit product accumulation in one machine cycle of 800 ns. 
This permits the realization of signal processing functions of such 
applications as dual-tone multifrequency receivers or low-speed data 
modems with a single device. The arithmetic precision of the proces- 
sor is also sufficient for many voice signal applications. 

I. INTRODUCTION 

Digital signal processing has become more and more important in 
telecommunications. As new products and services are offered, the 
amount of required signal processing continues to increase. In addition, 
signals are becoming digital, especially in applications where the 
superior stability and accuracy of these signals is either necessary or 
more attractive. Digital signal processing is also prompted by the 
introduction of digital switching offices and digital transmission sys- 
tems. It is made possible by the continuous, rapid growth of the silicon 
lsi and vlsi capabilities. The latter have made it inexpensive to build 
complex processors — so inexpensive that it is cost-effective even to use 
a/d conversion and digital signal processing in some analog systems. 
We indeed visualize the extension of the digital network all the way to 
the subscriber phone! 
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In this paper, we describe a single-chip, digital signal processor 
recently developed for Bell System use. The device, known as Digital 
Signal Processor (dsp) is a general-purpose building block which can 
be programmed to perform a variety of digital signal processing func- 
tions. Examples of these are filtering, equalization, modulation, tone 
detection, speech coding, and Fast Fourier Transform. The dsp is 
fabricated in depletion-load nmos and packaged in a 40-pin dip. It is 
customized to perform specific signal processing functions by means of 
an on-chip read only memory (rom) containing the program and fixed 
data. The device also contains a random access memory (ram) for 
writing and reading variable data, a Control Unit, an Arithmetic Unit 
(au), an Address Arithmetic Unit (aau), and appropriate Input/Out- 
put (i/o) circuitry. The dsp functions in a stand-alone manner, requir- 
ing only an external 5-MHz resonator or clock, or it may be directly 
interfaced with other processors to achieve a greater degree of signal 
processing capability. 

The dsp programmability makes the device useful for a variety of 
telecommunication applications, and results in a shorter and less 
expensive system development cycle. Key elements in digital signal 
processing are adequate numerical precision and high- computation 
rates. The dsp offers both. Its 16- by 20-bit multiplier and 40-bit adder, 
running at 1.25 million operations per second, are unparalleled in other 
lsi processors. 

The general dsp architecture is described in Section II. Section III 
centers on the dsp programming and includes a brief description of 
the instruction set. An example of a simple program is also given to 
illustrate the style of the input language. In Section IV, the dsp i/o 
interface is described. Finally, an overview of the dsp performance in 
typical filtering applications is given in Section V. 



II. ARCHITECTURE 

This section presents a description of the dsp architecture. As shown 
in Fig. 1, the principal features are as follows: 

(i) a 1024-word by 16-bit rom for instructions and fixed data; 
(ii) a 128- word by 20-bit ram for variable data; 
(Hi) an aau which generates addresses for the rom and ram mem- 
ories and performs post modification arithmetic on these addresses; 

(iv) an au which accepts a 16-bit and a 20-bit operand to form a 
36-bit product, accumulates the product with a 40-bit accumulator, 
and rounds the accumulator to a 20-bit word (with overflow protection) 
for storage or output; 

(v) an i/o section which serially receives and transmits either /x- 
255 law or linear pcm signal samples; and 
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(vi) a Control Unit for instruction decoding and overall system 
coordination. 

The dsp is also able to access a 1024-word by 16-bit external ROM, with 
no reduction in processing speed. This feature is especially convenient 
during program development and testing. It is also useful for small 
volume applications in which the expense of programming the on-chip 
rom is not justifiable. 

The analysis of many digital signal processing algorithms reveals 
that they basically perform multiplications and additions. Therefore, 
the au was designed to implement these operations efficiently. In its 
simplest form, the expression evaluated by the au is 

x>y + a— > a 

where, 

x is the 16-bit coefficient in register X, and 

y is the 20-bit data word in register Y. 

Again, the word lengths for x and y were determined by examining the 
requirements of a variety of telecommunication applications. A good 
compromise was established between the hardware required to imple- 
ment a given precision and the need for a general part, like the dsp, to 
cover most applications. 

The 36-bit product, p, is summed with the 36 least-significant bits of 
the contents, a, of the 40-bit accumulator, a, and the result is written 
into a. When the value in a is needed outside the au (e.g., to write to 
memory), the contents of a are truncated or rounded, overflow cor- 
rected (if necessary), then stored in the 20-bit au output register, w. 
The contents of w can then be transferred to other parts of the dsp via 
the 20-bit data bus, or can be used as data for another arithmetic 
operation. 

The au is pipelined in three stages: (i) the formation of the product 
x.y, (ii) the addition of the product to a, and (iii) the transfer of a to w. 
Thus, while this transfer is in progress for any one expression, the 
addition of the product in P to the contents of a for the next expression 
is also being performed, and the formation of the product x.y of the 
following expression is taking place. This pipelined structure keeps all 
parts of the au busy at all times and allows the processor to maintain 
a high throughput. 

The full capability of the au is described by the more general 
operation 
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where 

x = 16-bit coefficient which is read into register x from the 16 
most-significant bits of the 20-bit data bus. This coefficient is 
normally fetched from ROM, but could also be fetched from 
ram or the input buffer. 

y = 20-bit data word which is fetched from ram or the input 
buffer, and is read into register y from the data bus. The 
contents of y can also be written to the data bus. 

a = contents of the 40-bit accumulator a. The four extra bits are 
provided for overflow protection. 

w = rounded or truncated 20-bit au output word which is stored 
in register w. The contents of w can be written to the data 
bus for storage in ram or for output through the output buffer, 
or can be used instead of y in another arithmetic operation. 
The least-significant bit of w corresponds to bit 14 of a. This 
selection is consistent with the assumption that y and w are 
integers and that x is usually restricted by — 2 < x < 2. 
However, other choices are possible by shifting a before 
reading it. 

/ = linear or nonlinear function of either y or w, such as the actual 
value, the absolute value, or the sign function (signum) of one 
of these variables. 

f a = arithmetic function of a (e.g., scaling of a by 2 or 8) or a logical 
function of a and p{p and a). 

The 16-bit processor instructions are stored in the ROM. When 
coefficients are fixed, they will also reside in ROM. Data for the 
algorithm, whether it comes from the input or is generated by the 
algorithm, may be stored in the 20-bit-wide ram. In some applications 
(e.g., adaptive filters as required in echo cancelers) the coefficients are 
variable and are stored in the 16 most-significant bits of ram locations. 

Addresses for memory references are generated in the aau. Four 
memory addresses, required to access the instruction, the coefficient, 
and the data (both read and write), are multiplexed onto the 10-bit 
address bus in each processor cycle, and the corresponding information 
is multiplexed onto the 20-bit data bus. The program in ROM is accessed 
by the address stored in register PC, the program counter. Fixed 
coefficients in ROM can also be addressed by PC. Alternatively, coeffi- 
cients can also be addressed by the auxiliary register rx, which can 
point to either ROM or ram. Data, which is read from ram, can be 
addressed by ry or by an auxiliary register rya, while data can be 
written to ram by using addresses in rd or rda. The primary use of 
the auxiliary registers rya and rda is to allow manipulation of tem- 
porary results in a separate section of data memory. 
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The aau also provides a selection of possible increments for post- 
modification of these addresses. Under the direction of a given instruc- 
tion, the contents of the address registers are applied to the address 
bus and then incremented in the aau adder before being restored to 
the register, ready for subsequent use. The program and coefficients 
can be structured in ROM so that the contents of PC need only be 
incremented by +1. The contents of other address registers can be 
incremented by the amounts 0, +1, or —1, or by the contents of the 8- 
bit registers I, J, or k, as specified by the instruction. The program 
return register (pr) shown in Fig. 1, is used to provide a single level 
subroutine capability. The lc register is a 6-bit loop counter used to 
provide looping within an algorithm. All the registers mentioned above 
can be set to arbitrary values. This can be done unconditionally or 
subject to a particular condition being met. 

Instructions from ROM are latched into the instruction register ir 
and subsequently decoded in the Control Unit. In some auxiliary 
instructions, e.g., a register set from ROM or a register load from ram, 
a 16-bit argument follows the instruction; this argument goes to register 
xs or ys, respectively. The decoded signals are transferred from the 
Control Unit to the au, the aau, the i/o, and to the various registers, 
as needed. Arithmetic control information that is relatively invariant 
within an algorithm (e.g., the type of rounding arithmetic used in 
moving data from a to w, or the built-in scale factor used in some 
multiplier operations) is stored in the auc register. The ioc register 
stores a similar type of information for the i/o (e.g., the i/o rate, or 
the size and format of the input and output data words). 

The data interaction between the dsp and the outside world is 
carried out through the i/o structure. Inputs and outputs are processed 
through the buffer registers ibuf and obuf, respectively. The i/o 
interface accommodates a serial transfer of 8-, 16-, or 20-bit words 
under the control of either the dsp or a variety of external devices 
(e.g., codecs, microprocessors). Different i/o rates and formats are 
available to the programmer to facilitate this interfacing. Additional 
details will be given in Section IV. 

The setting (under program control) of register syc allows the user 
to suspend the dsp operation until a condition specified by one of the 
fields of this register is met. This can be used to synchronize the 
processor program with the data sample rate. The available conditions 
are input buffer full, output buffer empty, or the status of one of the 
two dedicated logical inputs cO and cl. A control input, cst, can be 
used to latch internally the values of cO and cl. Similarly, the setting 
of register str allows the user to output directly one or two logical 
signals (sO and si) and/or a synchronization pulse (stb). 

The serial i/o and its control require another ten pins; they are 
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described in Section IV. Sixteen of the 40 pins (dbs0-dbs15) are 
dedicated to the external data bus, which is used to access external 
rom in place of the internal, mask-programmable ROM. The remaining 
eight pins are used as follows: 

(i) two for the +5 V power supply (vcc) and ground (vss); 
(ii) three for the crystal connection (xtal), the clock input (clkin), 
and the clock output (clkout); 

(iii) one for resetting the dsp to a starting point (rest); and 
(iv) two for external memory control (exm and exe). 
The external rom is accessed by setting exm low; exe combined with 
clkout allows the generation of signals needed to latch the address 
coming out of the dsp through the DBS pins, latch the data fetched 
from the external ROM, and enable these data onto the DBS pins. 

III. PROGRAMMING 

The dsp has two types of instructions: normal and auxiliary. The 
normal instructions control processor computations in the au to eval- 
uate the general expression given in the last section. The three au 
operations of product formation, accumulation, and transfer to the au 
output register w (if required), are fully completed in one cycle of the 
processor. The operations are performed in parallel, each one corre- 
sponding to the partial evaluation of a different expression. 

For a normal instruction, a typical symbolic assembler input line 
consists of up to four expressions indicating 

(i) the source and destination of the data to be transferred out of 
the au, with the destination address register increment, 

(ii) the control of the au output register contents, 

(iii) the function to be performed by the accumulator, and 

(iv) the product to be formed by the multiplier, including a specifi- 
cation of the operand address registers and increments. 
When true program constants are used for product operands they may 
be indicated directly in the expression rather than indirectly through 
an address register. 

At the machine level, the 16-bit instruction has fields that control 
the above-mentioned functions, including the information needed to 
read the coefficient and data required in a later au operation, and to 
write the result of a previous au operation. Constants to be loaded 
into the x register are also 16 bits wide and are stored in rom following 
the corresponding instruction. 

Auxiliary instructions are used to control noncomputational aspects 
of the dsp, such as initialization of address registers and conditional 
inhibition of certain processor functions. They can also specify an 
additional set of computational operations for the au, such as com- 

ARCHITECTURE AND PERFORMANCE 1455 



/t-LINEAR 



a12 



,Z20 






^20 / 


\\b2^ 




a2l/ 




z- 1 








Z2\ I 










z 


-1 











Z12 Z22 

Fig. 2 — Fourth-order recursive filter. 



pressed/linear conversions or large shifts of the accumulator contents, 
which do not require the general argument flexibility available in 
normal computations. The assembler input for these instructions in- 
dicates in a simple format the special functions that they specify, as 
can be seen, e.g., in the register set instruction of the example below. 

At the machine level, a 16-bit auxiliary instruction is always followed 
by a 16-bit argument which is interpreted either as an extension of the 
instruction itself or as data associated with the instruction. Both 
normal and auxiliary instructions have common fields that allow 
writing of previous results or fetching of information required for later 
operations. 

Many features of the dsp are illustrated in a simple example of a 
fourth-order recursive filter shown schematically in Fig. 2 and in the 
assembler input code below. The filter has a ju-law input from the input 
buffer, two five-multiply second-order modules, and a linear output to 
the output buffer. The program begins at line 1 with a series of 
auxiliary instructions for initializing the dsp. The first seven instruc- 
tions are unconditional register set operations. The constants ioc and 
auc, to be written into the corresponding registers, reflect the desired 
options for i/o and au operations. The increment registers I, J, and k 
are set to 1, —1, and —3, respectively. Registers ry and rd are set to 0, 
the address of the first ram location. The constant syc, to be written 
into the respective register, reflects the desired condition for suspend- 
ing the dsp operation. 

Assembler input code for fourth-order recursive filter 

1: ioc = ioc; 

2: auc = auc; 

3: i = 1; 

4: j = -1; 
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5 








k = -3; 




6 








ry = 0; 




7 








rd = 0; 




8 


loop: 






syc = syc; 




9 








a = p + a 


p = mtll (ibufy); 


10 




w 


= a 


a = p 


p = mtl2 (); 


11 


obuf = w 






a = p + a 


p = 612**ry++i; 


12 








a = p + a 


p = 611**ry++j; 


13 








a = p + a 


p = al2**ry++i; 


14 


*rd++i = y 


w 


= a 


a = p 


p = all**ry++i; 


15 


*rd++i = w 






a = p + a 


p = al0*w; 


16 








a = p + a 


p = 622**ry++i; 


17 








a = p + a 


p = 621**ry++j; 


18 








a = p + a 


p = a22**ry++i; 


19 








pc = & loop; 




20 


*rd++i = y 


w 


= a 


a = p 


p = a21**ry++k; 


21 


*rd++k = w 






a = p + a 


p = a20*w; 



The instruction in line 8 is the first instruction in the main operating 
loop of the program. (This loop processes each sample through the 
filter.) Its function is to suspend the processor until the selected 
external synchronizing event occurs. This is the method used in this 
example for synchronization with an external sample rate clock. Lines 
9 and 10 are auxiliary instructions which perform the /x-law to linear 
conversion. This conversion is done on data which was fetched from 
ibuf. The accumulation, transfer to w, and write to obuf in lines 9, 
10, and 11 refer to the operations that were begun at the end of the 
loop. The practice of meshing the tail of the loop with its head is 
essential for writing low overhead code for this pipelined machine. 

The ram memory organization for this program is shown below: 



Location 


Variable 





Z12 


1 


Zll 


2 


Z22 


3 


Z21 



where the Zs are the state variables shown in Fig. 2. The values in 
registers i and J are used to modify the addresses in registers ry and 
rd so that these variable locations may be referenced. The k register 
resets these addresses after they are used for the last time in the loop 
with no additional overhead. The filter coefficients (612, 611, • • • , a21, 
a20) are stored in-line with the code. 

The instruction that sets the PC for the end-of-loop branch is at line 
19 instead of at the actual end of the loop. This is because of the 
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pipelined architecture. When the machine is executing the branch 
instruction at line 19, it is already decoding the instruction at line 20 
and is fetching the instruction at line 21. Therefore, the next instruction 
to be fetched will be at line 8. 

In this example, there are only two dsp cycles of overhead in the 
loop (the setting of syc and pc). The total loop has 14 cycles and could 
accommodate a sample rate of up to 89 kHz. 

IV. INPUT/OUTPUT INTERFACE 

The dsp architecture is designed to facilitate system interface with 
a minimum number of external components, if any. The i/o transfer 
of information is performed serially. The dsp i/o structure provides 
serial-to-parallel conversion of input data, and parallel-to-serial con- 
version of output data. Input and output operations are carried out in 
independent sections, thus, permitting them to be asynchronous with 
respect to each other, as well as with respect to the program execution. 
Several signals control the i/o transmissions. 

Five dsp pins are dedicated to the input serial transfer and its 
control, and five pins are dedicated to the output. The beginning of a 
serial transfer is indicated by a synchronization signal present at the 
isy (input synchronization) pin for an input, or at the osy (output 
synchronization) pin for an output. Input data bits are received at pin 
di and advanced into the ibuf register of the dsp by the clock signal 
present at pin ick (input clock). Output data bits are available at pin 
do and are shifted out ofthe obuf register of the dsp by the clock 
signal present at pin ock (output clock). The two enable lines ctr (not 
clear to read) and cts (not clear to send) can be used to activate the 
input and output sections, respectively. A high level on one of these 
pins causes the dsp to be inactive on all the pins associated with that 
particular section, and tristates the corresponding off-chip drivers. 
This allows several dsps to be switched on and off a single external 
i/o bus. The flags ibf (input buffer full) and obe (output buffer empty) 
indicate the status of the respective buffers. These flags can be used to 
control external hardware and synchronize data transfers between the 
dsp and its peripherals. They can also be internally tested by certain 
dsp instructions. 

The dsp i/o unit is programmable via the 10-bit ioc register. This 
register configures the input and output sections of the dsp to either 
generate or accept the clock and synchronization signals. If a section 
of the dsp generates these signals, it is said to be in the ACTIVE mode; 
otherwise, it is in the PASSIVE mode. The ioc also controls the length 
of the serial data transfer to be either 8, 16, or 20 bits. In addition, the 
ioc controls the i/o clock rate for active mode. The rate can be either 
% % V32, or V64 of the dsp input clock. Finally, for both input and 
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x3gzx3dc:z)G3cex: 



SYNC 



CLOCK 



DATA 



run_n_n_n_r' . .n_n_n_ 



2- 



SYNC 



rLTLTLT" ..rLTLTL 



x=dgdgdc ::: zx^dgdc 



SYNC 



3- CLOCK 



DATA 



rLTLTLTLT ..rLTUTL 



X>3CEXVXZ : DG3GDC 



Fig. 3 — Input/output active formats. 

output, the ioc determines the timing relationship between the syn- 
chronization signal, the clock signal, and the first bit of data to be 
transferred. This is done by selecting one of four formats. Figures 3 
and 4 display the various dsp formats for the active and passive modes, 
respectively. These formats allow the dsp to be readily interfaced to a 
variety of circuits and systems. In the ACTIVE mode, the dsp gener- 
ates a burst of clock pulses whose number is a function of the selected 
format and the length of the serial transfer. In the PASSIVE mode, 
the synchronization and clock signals are supplied by an external 
source. The dsp accepts a continuous i/o clock. It should also be 
emphasized that the input and output sections are independently 
programmed except for the i/o transfer rate in the ACTIVE mode. 

V. PERFORMANCE 

The amount of signal processing that can be performed by the dsp 
depends on the cycle time t c , which is the time for basic machine 
operations, such as a multiply, or the setting of a register. Specifically, 
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3 



CLOCK 



urur^ruiT ..ji_tltl 



xiXEXEXZ : : : zxedgdc 



SYNC 
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CLOCK \ \ 



DATA 
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rNC _ J 



3- CLOCK 



DATA 



ix__n_r^TJ^"..JTJUi 



x3Gdgd(z : : : zxedgdc 



AC = 



Fig. 4 — Input/output passive formats. 

if the sampling frequency is f s (Hz), and the cycle time is t c (s), the 
number of machine cycles available (ac) per sample period is 

tc ' 

A basic machine operation requires four processor states. For a proc- 
essor state period of 200 ns (5-MHz clock), the cycle time t c is 800 ns; 
for an 8-kHz sampling rate, this provides 156 cycles per sample period. 

The capacity for basic signal processing algorithm segments is now 
simply determined. For example, a five coefficient second-order recur- 
sive filter section (Fig. 2) requires five machine cycles and, thus, the 
dsp, running at 5 MHz, can execute 31 sections at the 8-kHz sampling 
rate. In practice, of course, one cannot implement as many sections 
since there will be a variety of other tasks, such as initialization and 
i/o, which will somewhat reduce the amount of filtering in an actual 
application. 

Next, consider the size of the ROM. In most applications, each 
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machine cycle requires one word of ROM for the instruction and one 
word of rom for the coefficient. At the 8-kHz sampling rate, there are 
156 available cycles; if all of these cycles are used in the algorithm, the 
rom would need 156 + 156 = 312 words. However, in some applications, 
more than one coefficient can be associated with an instruction. An 
example is when a section of the algorithm is looped over more than 
once — each encounter of an instruction in the loop can be associated 
with a different coefficient. A study of such applications, as well as 
applications where two or more alternative programs must be resident 
in the processor at once, led to the 1024-word size for the dsp rom. 

The ram size depends on the number of data words that need to be 
stored. For the recursive structures shown in Fig. 2, two items of data 
must be stored for each second-order section (sos). Thus, if an 8-kHz 
sample rate is assumed and the maximum number of 5-multiply soss 
were programmed, then 31 X 2 = 62 words would be used. With 4- 
multiply soss, 78 words would be required. The ram size for dsp is 128 
words which is quite sufficient for the recursive filter applications. In 
the case of nonrecursive fir filters, where one needs one storage 
location for each multiplication, this ram size will allow a 128-tap 
filter. These results are summarized in Table I. 

Table I — Performance features of dsp 



16 by 20 Multiply-add (16-bit coefficient 20-bit data) 

800 ns Cycle time 

16,384-bit rom (1024 words by 16 bits/word) 

2,560-bit ram (128 words by 20 bits/word) 

31 5-multiply soss @ 8-kHz sample rate 

39 4-multiply soss @ 8-kHz sample rate 

128 fir taps @ 8-kHz sample rate 



The level of performance is such that the signal processing required 
for a dual-tone multifrequency receiver, or a low-speed fsk modem 
can be implemented in a single dsp device. 

Some further specifications of the dsp are given in Table II. 

The circuit consists of approximately 45,000 transistors (with pro- 
gram rom) within a 68.5-mm 2 chip area. 

VI. CONCLUSION 

The integrated circuit dsp described in this paper was designed not 
only to serve as a programmable processing element for general- 
Table II — Specifications for dsp 



Power supply +5 V 
Clock frequency 5 MHz 

Max. i/o serial rate 4.5 Mb/s 
Max. power dissipation 1.25 W 

Package 40-pin dip 
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purpose use in telecommunications applications, but to further en- 
hance its use in these applications by reducing development effort. 
The dsp, therefore, has not only the processing capacity and precision 
for a number of common, small applications for both voice and data 
processing, but also may be easily interfaced to system data streams 
and to other processors to realize complex algorithms. 

The dsp is the first element of a family of devices that is being 
developed at Bell Laboratories for digital signal processing in telecom- 
munication applications. 

VII. ACKNOWLEDGMENTS 

The authors wish to acknowledge the efforts of F. E. Barber, T. J. 
Bartoli, D. B. Cuttriss, R. L. Freyman, J. A. Grant, B. R. Jones, J. 
Kane, R. A. Kershaw, C. R. Miller, H. E. Nigh, N. Sachs, W. A. 
Stocker, E. F. Schweitzer, and W. Witscher, Jr. in the implementation 
of this circuit and the guidance of H. C. Kirsh, R. A. Pedersen, and D. 
C. Stanzione throughout the project. 



1462 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1 981 



